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Abstract. Let G be an Eulerian directed graph with an arc-labeling 
such that arcs going out from the same vertex have different labels. 
In this work, we present an algorithm to construct the Eulerian trail 
starting at an arbitrary vertex v of minimum lexicographical label among 
labels of all Eulerian trails starting at this vertex. 

We also show an application of this algorithm to construct the mini- 
mal de Bruijn sequence of a language. 



1. Introduction 

Eulerian graphs were an important concept in the beginning of the graph 
theory. The "Konigsberg bridge problem" and its solution given by Euler in 
1736 are considered the first paper of what is nowadays called graph theory. 

In this work, we consider graphs with an arc-labeling with the follow- 
ing property: Arcs going out from the same vertex have different labels. 
These graphs are commonly utilized in the automata theory: a labeled di- 
graph represents deterministic automata where vertices are the states of the 
automata, and arcs represent the transitions from one state to another, de- 
pending on the label of the arc. Eulerian trails over these graphs are related 
with synchronization of automata (see [?]). 

Eulerian graphs with this kind of labeling are also used in the study of 
DNA. By DNA sequencing we can obtain fragments of DNA which need 
to be assembled in the correct way. To solve this problem, we can simply 
construct a DNA graphs (see [?]) and find an Eulerian trail over this graph. 
This strategy is already implemented and it is now one of the more promising 
algorithms for DNA sequencing (see [?, ?]). 

To find the Eulerian trail of minimal label is also interesting with respect 
to the problem of finding of optimal encoding for DRAM address bus. In 
this model, an address space of size 2^" is represented as labels of edges in a 
complete graph with 2" vertices. An Eulerian trail over this graph produces 
an optimal multiplexed code (see [?]). If we want to give priority to some 
address in particular, the Eulerian cycle of minimal label give us this code. 

Another interesting application of these graphs is to find de Bruijn se- 
quences of a language. De Bruijn sequences are also known as "shift register 
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sequences" and were originally studied in [?] by N. G. De Bruijn for the 
binary alphabet. These sequences have many different applications, such 
as memory wheels in computers and other technological device, network 
models, DNA algorithms, pseudo-random number generation and modern 
public-key cryptographic schemes, to mention a few (see [?, ?, ?]). More 
details about this application are discussed in Section 3. 

By the BEST theorem (see [?]), we can compute the number of Eulerian 
trails in a graph. This number is usually exponential in the number of 
vertices of the graph (at least ((7 — where V is the set of vertices and 

7 is the minimum degree of vertices inV). Therefore, finding the Eulerian 
trail of lexicographically minimum label can be costly. 

In this work, we give an algorithm to construct the Eulerian trail of min- 
imum label starting at a given vertex. The complexity of the algorithm is 
linear in the number of arcs of the graph. In Section 2 we give some defini- 
tions to understand the problem and we prove the main theorem. Finally, in 
Section 3 we give an application of this algorithm to construct the minimal 
de Bruijn sequence of a language. 

2. Main Theorem 

Let G be a digraph and let / : A{G) ^ N he a labeling of the arcs of 
G over an alphabet N such that arcs going out from the same vertex have 
different labels. 

A trail is an alternating sequence W = viaiV2a2 ■ ■ ■ "^k-i^^k-i^k of vertices 
Vi and arcs aj such that the tail of Oj is Vi and the head of is Vi^i for 
every i = 1,2, . . . ,k — 1 and all arcs are distinct. If vi = then I^ is a 
closed trail. A closed trail is an Eulerian trail if the arcs of W are all the 
arcs of G. An Eulerian graph is a graph with an Eulerian trail. The label 
of W is the word l{ai) . . . l{ak-i)- 

Given a strongly connected Eulerian digraph and a vertex r, we show 
how to find the Eulerian trail starting in r with the minimal lexicographical 
label. Remark that is important to fix a starting vertex r so at to define an 
order in which vertices are visited, which allow us to define a lexicographical 
order among Eulerian trails. 

Let U he a subset of vertices in G. A cut is the set of arcs with one 
end in U and the other in V \ U, and is denoted by 5g{U). A vertex v is 
exhausted by a trail W if (5g\A(iv)(^) = 0- The set of vertices exhausted by 
W is denoted by S{W). 

Lemma 1. Let U, B he subsets of vertices and let T be the trail starting in 
r of minimum label exhausting U. If U C. B C. S{T), then T is the trail of 

minimum label exhausting B. 



Proof. Let T' he a trail starting in r exhausting B with a smaller label 
than T. Since U C. B then T' exhausts U. Hence, the label of T is not 
minimal. □ 
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A trail W can visit a vertex v many times. We decompose a trail W in 
the sub-trails Wv and vW, where Wv is the sub-trail of W finishing in the 
last visit of v, and vW is the sub-trail of W starting from the last visit of v. 
We denote vW the trail vW without the first vertex v but containing the 
first arc of vW. 

Lemma 2. Let T be a closed trail starting in r such that r € S{T). Let v 
be the last vertex not belonging to S{T) visited by T. If w is its next vertex 
in T then 

^G\A{Tv)ivT) = {vw} 

Proof. Let xy be an arc of dc{vT). Since all vertices of vT are exhausted 
by T, xy G ^(T). Hence either xy G A(Tv) or xy G A{vT). Therefore 

xy G Sg\A{Tv){'vT) if and only if xy = vw. □ 

We define the following strategy to construct a trail: Starting at a given 
vertex v, follow the unvisited arc (if exists) of minimal label. This strategy 
finishes with a closed trail, and this trail exhausts the vertex v. A trail 
constructed by this strategy is called an alphabetic trail starting at v and is 
denoted by W{G,v). By definition, an alphabetic trail starting at v is the 
trail of minimal label among all trails starting at v and exhausting v. 

Let vhe a vertex and let T be the closed trail of minimal label exhausting 
all vertices in vT. We find the trail of minimal label exhausting all vertices 
in vT. If v G S{T) then by Lemma 1 the trail T is the solution to this 
problem. If v ^ S{T) then the next lemma give us the solution: we need to 
split T and insert the alphabetic trail over G\A{T) starting at v. Repeating 
this process we finish with the Eulerian trail of minimal label. 

Lemma 3. Let T be a closed trail starting and exhausting r such that if v is 
the last vertex in V\S{T) visited by T then T is the closed trail of minimum 
label exhausting vT. 

Let Z be the closed trail of minimum label in exhausting vT and let W = 
WiG\A{T),v). Then Z = {Tv)W{vT). 

Proof. By supposition, T is the closed trail of minimum label exhausting 
It and It C S{Z), hence 1{Z) > 1{T). In particular, 1{Z) > l{Tv). Also 
Z and iTv)W{vT) exhausts vT. Hence 1{Z) < l{(Tv)W{vT)), concluding 
that Z = {Tv)Z'. 

By Lemma 2 the only way to visit vertices in vT is using the arc vw, and 
vT is the trail of minimum label exhausting V(yT) in G\ {A{Tv)). Since Z 
is a closed trail of minimum label, Z = {Tv)Z" {vT). 

Finally, Z" is a closed trail of minimum label in G \ A{T) exhausting -y, 
therefore Z" = W. □ 

Theorem 1. Algorithm 1 finishes with an Eulerian trail starting in r and 
its label is the minimal one among all Eulerian trails starting in r. 
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Algorithm 1 Compute the minimal Eulerian trail starting in r 

V ^NoEx(r) {v = r} 
while V / NULL do 

W ^ W{G \ A{T), f;).over G \ A{T). 
{Tv)W{vT) 

V ^ NoEx(r). 
end while 

Where NoEx(r) returns the last non-exhausted vertex visited by T or 
NULL if this vertex does not exist. 



Proof. At each repetition of the "while", the trail T exhausts at least one 
vertex non-exhausted in the previous step, so the algorithm finishes in a 
finite number of steps. 

We define inductively = G\A{T'-'^), =NoEx(rj), = W{G\v') 
and = {T'-^v^-'^)W'{v^-^T'-'^), with Tq = 0. 

Wc prove by induction that T* is the closed trail of minimal label ex- 
hausting V T\ For i = 1, = W{G, r) is by definition the closed trail of 
minimal label exhausting r, and by Lemma 1 it is the trail of minimal label 

exhausting v T . Let be the closed trail of minimum label exhausting 

oi— 1 . , . -J 

V . Applying Lemma 3 to T*~ , we conclude that T* is the closed 

trail of minimal label exhausting v^~^T^ and by Lemma 1 it is the minimal 

closed trail exhausting v T*. 

Therefore the algorithm finishes with a closed trail T exhausting all its 
vertices V{T), but G has only one strongly connected component, thus 
A{T) = A{G). We conclude that T is an Eulerian trail of minimal label. 

□ □ 

We can use the following structure to represent the graph, a list of size \ V\ 
representing vertices where each element v in the list has a stack with the 
head of each arc starting at v in order. Knowing this structure of a graph, the 
algorithm can easily construct the trails W{-, •), removing the visited nodes 
from the stack and keeping track of exhausted vertices. Since this algorithm 
visits each arc at most twice, it can be implemented in 0(1^(0)1), which is 
best possible. 

Remark that while the initial vertex r can be arbitrarily chosen, different 
initial vertices can produce different trails, even if we consider the label as 
a circular string. For example, in the graph of Figure 1, the minimal de 
Bruijn sequence starting at u is 001122 but starting at v is 100122. 

3. An APPLICATION: MINIMAL DE BrUIJN SEQUENCE 

Given a set V of words of length n, a de Bruijn sequence of span n is 
a periodic sequence B such that every word in V (and no other n-tuple) 
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Figure 1 



appears exactly once in B. Historically, de Bruijn sequence was studied in 
an arbitrary alphabet considering the language of all the n-tuples. In [?] the 
concept of de Bruijn sequences was generalized to restricted languages with 
a finite set of forbidden substrings and it was proved the existence of these 
sequences and presented an algorithm to generate one of them. Nevertheless, 
it remained to find the minimal de Bruijn sequence in this general case. 

In [?] was studied some particular cases where it is possible to obtain 
efficiently the minimal de Bruijn sequence. Using our previous algorithm we 
can solve this problem efficiently in all cases. 

A word p is said to be a factor of a word w if there exist words u,v e N* 
such that w = upv. If u is the empty word (denoted by e), then p is called 
a prefix of w, and if v is empty then is called a suffix of w. 

Let P be a set of words of length n + 1. We call this set a dictionary. A 
de Bruijn sequence of span n + 1 for "D is a (circular) word of length 

\T>\ such that all the words in V are factors of In other words, 

{{B'^'^+% . . . (i?^'"+i),+„ ^od (n+1) \i = 0...\V\} = V 

De Bruijn sequences are closely related to de Bruijn graphs. The de Bruijn 
graph of span n, denoted by G^'^\ is the directed graph with vertex set 

V{G^'"') = {u & N"'\u is a prefix or a suffix of a word in P} 

and arc set 

A^Q'D,n^ = {{av,v/3)\a,(3 G N,av(3 G V} 

Note that the original definitions of de Bruijn sequences and de Bruijn 
graph given in [?] are the particular case of V = N^^^. 

We label the arcs of the graph G^'" using the following function I: if 
e = {au,u(3) then l{e) = j3. This labeling has an interesting property: Let 
T = vqCo ■ ■ ■ CrnVm+i be a trail over G'^''^ of length m > n. Then T finishes in 
a vertex u if and only if is a suffix of 1{T) = /(eg) . . . l{em)- This property 
explains the relation between de Bruijn graphs and de Bruijn sequence: 
QV,n+i jg ^YiQ label of an Eulerian trail of G^'". Therefore, given a dictionary 
T>, the existence of a de Bruijn sequence of span n + 1 is characterized by 
the existence of an Eulerian trail over G^'"". 

Let D he a dictionary such that G^'" is an Eulerian graph. Let z be the 
vertex of minimum label among all vertices. Clearly, the minimal de Bruijn 
sequence has z as prefix. Hence, the minimal Eulerian trail over G^'" starts 
at an (unknown) vertex and after n steps it arrives to z. Therefore if we 
start our Algorithm 1 in the vertex z we obtain the Eulerian trail of minimal 
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label starting at z which have label B = B' ■ z. Hence z ■ B' is the minimal 
de Bruijn sequence of span n + 1 for P. 
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